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PRELIMINARY AMENDMENT 



Honorable Commissioner for Patents 
BOX PCT 

Washington, D.C. 20231 
Dear Sir: 

Kindly amend the subject application as follows: 
IN THE SPECIFICATION: 



Please insert the paragraph heading on page 1 of the English translation of 
the subject application, before line 7, as follows: 

-- Technical Field ~. 
Please insert the paragraph heading on page 1 of the English translation of 
the subject application, line 10, as follows: 

—Background Art ~. 
Please insert the paragraph heading on page 2 of the English translation of 
the Annex to Form PCT/IPEA/409, before line 30, as follows: 

— Summary of the Invention --. 
Please insert the paragraph heading on page 4 of the English translation of 
the subject application, before line 1, as follows: 

—Brief Description of the Drawings -- . 
Please insert the paragraph heading on page 4 of the English translation of 
the subject application, line 30. as follows: 

- Detailed Description of the Invention -. 
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The Commissioner is hereby authorized to charge any deficiencies or credit 
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Account No. 50-0426. 
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IN THE SPECIFICATION: 

The paragraph heading has been inserted on page 1 of the English translation 
of the subject application, before line 7, as follows: 

Technical Field 

The paragraph heading has been inserted on page 1 of the English translation 
of the subject application, line 10, as follows: 

Background Art 

The paragraph heading has been inserted on page 2 of the English translation 
of the Annex to Form PCT/IPEA/409, before line 30, as follows: 

Summary of the Invention 
The paragraph heading has been inserted on page 4 of the English translation 
of the subject application, before line 1, as follows: 

Brief Description of the Drawings 
The paragraph heading has been inserted on page 4 of the English translation 
of the subject application, line 30, as follows: 

Detailed Description of the Invention 
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Patent claims 

1. Method for processing conditional jump 
instructions in a processor with pipeline computer 
architecture, that has the following steps: 

a) loading and decoding a processor instruction, the 
processor instruction containing an instruction 
opcode, register addresses, a relative jump 
distance, a precondition and a post-condition, 

b) execution of the decoded processor instruction if 
the precondition is fulfilled, and 

c) jumping to a jump address as a function of the 
relative jump distance contained in the processor 
instruction if the post-condition is fulfilled. 

2. The method as claimed in claim 1, in which the 
post-condition comprises a plurality of post- 
condition bits that are checked in the processor. 

3. An apparatus for processing conditional jump 
instructions in a processor with pipeline computer 
architecture , having : 

an instruction decoder (20) for decoding a 
processor instruction that contains an instruction 
opcode, register addresses, a relative jump 
distance, a precondition and a post-condition, 

the instruction decoder (20) checking in the case 
of a fulfilled precondition whether the post- 
condition is fulfilled and, in the case of a 
fulfilled post -condition, driving a program 
counter (14) for forming a jump address as a 
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function of the relative jump distance contained 
in the processor instruction. 
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Description 

Method and apparatus for processing conditional jump 
5 instructions in a processor with pipelined architecture 

The present invention relates to a method and an 
apparatus for processing conditional jump instructions 
in a processor with pipelined architecture. 

The number of cycles required for executing specific 
instructions is one of the most important performance 
parameters of a processor. The number of cycles is to 
be minimized as far as possible in order to achieve 
maximum processing speed and minimum power consumption. 
Processors with what is termed pipelined architecture 
are already known for this purpose in the prior art . 
This means that the processor processes a plurality of 
instructions simultaneously, each instruction being in 
a different stage of processing. For example, one 
instruction is just being executed, the next is 
simultaneously already decoded, the next but one has 
been requested from the memory, etc. 

25 It is possible, in particular, in such a pipelined 
architecture for a conditional jump instruction 
(branch) to lead to what is termed a harzard, as a 
result of which it is even possible for wrong results 
to be produced. Specifically, in the case of a 

30 conditional jump instruction, the address of the next 
instruction is not fixed until after processing of this 
conditional jump instruction. In this way, therefore, 
the next instruction can be requested from the memory 
and decoded only once the result of the execution of 

35 the preceding instruction is available from the 
arithmetic-logic unit of the processor. 
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In accordance with the prior art, this hazard problem 
has been solved in such a way that, directly after the 
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— remain correct in any 

case. However, not as many processor cycles are thereby 
utilized as dummy instructions that need to be 
5 processed. 



M.J. MAHON ET AL . : ^ HEWLETT- PACKARD PRECISION 
ARCHITECTURE: THE PROCESSOR' HEWLETT PACKARD JOURNAL, 
HEWLETT-PACKARD CO. PALO ALTO, US, Vol. 37, No. 8, 

10 August 1, 1986 (1986-08-01), pages 4 - 22, XP000211314 
disclose, inter alia, that in the case of the execution 
of a branching instruction or jump instruction in a 
processor with pipeline processing of the instructions, 
a delay instruction is inserted following the jump 

15 instruction in order to permit the calculation of a 
jump destination address before a destination 
instruction of the jump is loaded, or the program flow 
runs further to the destination instruction. The delay 
instruction is not executed if the same is canceled by 

20 nullification by the immediately preceding jump 
instruction. In the case of nullification, an 
instruction that immediately follows a jump instruction 
is executed as NOP. All jump instructions have for this 
purpose a 1-bit nullification field that controls the 

25 nullification and thus the activation or deactivation 
of the delay instruction as a function of a jump 
instruction, in order to optimize the use of the delay 
instruction in jump instructions. 



30 It is therefore the object of the present invention to 
permit the processing of conditional jump instructions 
in a processor with pipelined architecture without so 
great a loss of processor cycles by dummy instructions. 

35 This object is achieved by means of a method as claimed 
in claim 1 and an apparatus as claimed in claim 3 . 
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According to the invention, this object is achieved by 
means of a method for processing conditional jump 
instructions in a processor with pipelined architecture 
in the case of which there are added to each 
instruction according to which a conditional jump is to 
be executed one or more additional bits that specify 
under which condition the conditional jump is to be 
executed. It is already possible in this way to 
establish earlier an instruction as to whether a branch 
is to be carried out or not. Consequently, an 
instruction which will be the next instruction after 
the conditional jump is already fixed earlier. It is 
therefore possible to establish the jump destination of 
a conditional jump instruction much earlier by means of 
this branch prediction in the instruction set. 

It is particularly preferred in this case that in 
addition the appropriate jump address is added to each 
instruction according to which a conditional jump is to 
be executed. In this way, not only is an instruction 
known earlier as to whether a conditional jump is to be 
carried out or not, but the corresponding new 
destination address is already known. The correct 
instruction can therefore already be requested from the 
main memory of the processor. 

Furthermore, it is preferably possible in addition to 
add to each instruction one or more bits that specify 
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The present invention is explained in more detail below 
with the aid of the drawings attached in the enclosure, 
in which: 

5 Fig. 1 shows the operating cycle of a processor with a 
two- stage pipeline; 

Fig. 2 shows the design of a 22 -bit long machine 
instruction according to the invention; 

10 

Fig. 3 shows the design of a 25 -bit long machine 
instruction according to the invention; 

Fig. 4 shows a schematic of an apparatus according to 
15 the invention for altering the program counter reading 
for the purpose of executing conditional jumps; 

Fig. 5 shows a further apparatus according to the 
invention for altering the program counter reading for 
2 0 the purpose of executing conditional jumps; 

Fig. 6 shows a schematic of the overall design of a 
processor with pipelined architecture for the purpose 
of executing conditional jump instructions with the 
25 branch prediction according to the invention; and 

Fig. 7 shows a detailed illustration of a processor 
with apparatuses for the branch prediction according to 
the invent ion . 

30 

The present invention proceeds from a pipelined 
architecture for a processor. This is described, for 
example, in the book entitled "Computer Organisation 
and Design" by Pattersen & Hennessy. 

35 

Put briefly, the pipelined architecture signifies the 
following : 
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Normally, each machine instruction is processed by a 
processor by means of the following operations: 

1. Instruction fetch 

2. Instruction decoding 

3 . Execution 

4 . Write back 

It is already known in the prior art to have these 
operations run partially in parallel, for example by 
executing an instruction precisely when the next 
instruction is already being decoded. This mode of 
procedure is illustrated in figure 1 for a two-stage 
pipeline . 

Thus, a processor uses the pipeline in order to process 
on average one instruction per processor cycle. 

However, this pipelined architecture of the processor 
20 leads to problems whenever conditional jump 
instructions are to be executed. This problem is termed 
"branch harzard" in technical language. This means that 
a branch instruction, that is to say a conditional jump 
instruction, can show whether the next instruction is 
2 5 to be further processed or a jump is to be made to 
another destination address only after execution of the 
preceding instruction. 



15 



30 



This problem is solved in the prior art by filling the 
clock pulse after the conditional jump instruction with 
a no-operation instruction, that is to say an 
instruction to wait for a processor cycle. Although it 
is certainly ensured in any event that the program 
continues to run correctly, one processor cycle is 
35 lost, and thus so is the maximum possible computer 
power. The prior art is to be explained in more detail 
with the aid of the following examples, which 
respectively treat the calculation of the absolute 
value of a number: 
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Firstly, there is the possibility of conditional 
execution, that is to say, for example: 



10 



15 



20 



/* A = |B| */ 

LOAD Rl B 

COMPARE Rl 0 /*if B > 0, carry = 0 */ 
NEGATIVE Rl on-carry /* negate if carry = i */ 
STORE Rl A 

This type of execution is possible, however, only if 
only a single instruction must be executed 
conditionally, and this instruction includes no jump. 
In the case of more complex functions or tasks that can 
no longer be represented only by one instruction, a 
conditional jump must respectively be performed, as 
illustrated in the following program. As may be seen 
from the boxed program section, a no-operation 
instruction must be inserted downstream of the two jump 
instructions (in the case of a two-stage pipeline, and 
with longer pipelines correspondingly more no-operation 
instructions : 



LOAD 


Rl 


B 


COMPARE 


Rl 


0 


JUMP ON 


CARRY 


LI 


JUMP 


L2 




NO OP 







LI : 
L2 : 



NEGATIVE 
STORE 



Rl 

Rl A 



25 Finally, there is also the option in the prior art of 
what is termed speculative execution. This means that 
only one option is executed in the hope of hitting the 
correct continuation with a probability of somewhat 
more than 50%. However, this requires a very 

3 0 considerable outlay on hardware, since it is then 
necessary, after all, for some instructions to be 
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"^unraveled" should the supposition not apply. Moreover, 
processor cycles are lost nevertheless if there has 
been a "mis -estimation" . 

5 In accordance with the prior art, there has thus so far 
not been any suitable solution to this problem, that 
such a branch harzard, that is to say a problem with 
the conditional branching, has effected a loss in 
working cycles of the processor in pipelined 
10 architecture. According to the invention, this problem 
is now solved by means of a combination of an 
instruction with '"conditional execution" and a '"jump 
arithmetic" instruction, as follows: 

15 The aim here, once again, is to consider a simple 
example, specifically the instruction "Add R2 to Rl, if 
Rl then = 0, jump to LI". This problem is written as 
follows in '"C" : 

20 Rl = Rl + R2 
if (Rl = = 0) 
GO TO LI 
LI: 

25 According to the invention, the machine instruction ADD 
Rl, R2, #JMP, ON ZERO is used for this purpose. #JMP 
means in this case the relative jump address to the 
entry point LI. 

3 0 We thereby extend the instruction once by a post- 
condition to the known pre-condition. For example: PI, 
ADD Rl, R2, #JMP, Ql . 

Here, PI means: execute R1=R1+R2 if PI is fulfilled. 
3 5 According to the invention, Ql means: execute jump by 
JMP if Ql is fulfilled after the calculation of 
R1=R1+R2 . 
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It would thereby be possible for the following ^^C" 
program: 

if (A=l) 
^5 B = A; 

else 
C = A; 

to be translated into machine code as follows: 

10 

LOAD 

Ql TEST 
PI Ql STORE 
STORE 

15 

Thus, according to the invention, it is possible to 
provide in the instruction coding both bits for 
preconditions and bits for post -conditions, as is 
illustrated by way of example in figs. 2 and 3. 

20 

Fig. 2 shows in this case a simplified example with an 
instruction of only 22 bits in length, one bit 1 being 
provided for the precondition, one bit 2 for the post- 
conditions, 8 bits 3 to 10 for the relative 
25 displacement and then, as usual, three bits each for 
the two register addresses and 6 bits for the 
instruction code. 

It is usually necessary in reality to check a plurality 
30 of conditions as precondition and post -condition . 
Consequently, correspondingly more bits must be 
provided, as is illustrated in fig. 3. 

In fig. 3, the bits 0 to 1 contain the information for 
35 post-condition, the bits 2 and 3 information for pre- 
conditions, and the bits 4 to 10 the relative jump 
address, that is to say the jump distance. 



Rl A 

Rl 1 # L /* if A=l jump L */ 

Rl B 

Rl C 
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The method according to the invention can be used with 
particular effect in conjunction with a program loop, 
for example for the following "C" program: 

5 for (i=l; i < 5; i++) { 

X [i] = i; 
} /* C program */ 

According to the invention, this can then be converted 

10 into the following substantially simplified machine 
program : 

Load Rl 5 

Load R2 X /*Address of X[5]*; 

15 

Ll= STORE_INDEXED R2 Rl /* x[i] = i */ 

Ql DECREMENT Rl #1 LI 

ADD R2 1 /*i = i+1*/ 

20 The post-condition Ql means: conditional jump if the 
result R1=R1-1 is not 0. 

A further example of the simplifications that can be 
achieved according to the invention when programming is 
25 the program, represented below, for processing a ring 
buffer. 

In accordance with the prior art, this problem would 
have had to be programmed as follows: 

30 

TST (R3) #buffer_end // ring buffer end reached 

BNZ NEXT // if no 

NOP 

LDI (R3) #buf fer_start // else set the pointer to 

buffer again 



According to the invention, the following two 
instructions suffice instead of this: 
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TST (R3) 
LDI (R3) 



#buffer start 



#buffer end 



However, it is to be borne in mind that this solution 
5 according to the invention cannot be applied to all 
loop structures. Loop structures of all sorts can, 
however, be programmed as follows according to the 
invention: 

0 LDI (R4) #loop_cnt_minus_l // init loop counter 

WHILE LOOP: 



According to the invention, instead of the usual 
subtraction machine instruction SUB, use is made of a 

20 machine instruction SUBI that is extended such that it 
has a flag bit that is used for the purpose of 
indicating a cycle before the conditional jump 
instruction BNZ, which is the correct branch in the 
case of the conditional jump, such that no loss of 

25 processor cycles occurs at all in the case of a two- 
stage pipeline. Instruction LDI indicates a loop start. 

The typical solution for avoiding the branch harzard is 
based on predicting the expected jump destination of 
30 the conditional jump. 

The implementation of a loop generally requires three 
steps : 

35 1. Initialize the loop counter 

2 . Decrement or increment the loop counter 

3 . Jump to the end of the loop 



15 



FIRST_PC // code sequency 

SUBI (R4) #1 #loop_flag // decrement by 1 and 

indicate loop end 
BNZ WHILE_LOOP // if not zero go to 

loop begin 
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The cycle loss in the case of the conditional jump is 
based on the fact that the next instruction that is 
executed after the jump depends on the fulfillment of 
the loop condition. Consequently, dummy instruction NOP 
5 must be inserted after the conditional jump 
instruction. By using a loop flag in an arithmetic 
instruction such as ADD or SUB, the loop condition can 
be checked at the end of the execution of the addition 
or subtraction instruction. The zero flag, that is to 

10 say the indication by the arithmetic-logic unit that it 
is at 0, can then be checked in order to decide to 
which address the program counter of the processor 
should be set. This LOOP flag can be interpreted as an 
ENABLE-DISABLE flag or, more generally, as an address 

15 displacement. 

Figure 4 shows the simplest basic principle for 
implementing a LOOP flag in accordance with the 
invention. 

20 

The program memory 10 is connected in this case to the 
program counter 14 via a multiplexer 12. The output of 
the program counter (PC) 14 is connected to a logic 
gate 16 that combines the output value of the program 

25 counter with a constant or the LOOP flag. The output of 
this logic circuit 16 is connected to the input of the 
multiplexer (MUX) 12, whose other input is connected, 
after all, to the program memory 10, and whose output 
is connected to the program counter 14. The multiplexer 

30 12 is controlled by a control signal (control) from the 
processor. 

A further improvement to the invention permits 
dispensing with the jump instruction by buffering the 
3 5 start of the loop: 

LDP (R4) #loop_cnt_minus_l 
WHILE_LOOP: 

FIRST PC 
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SUBI {R4) #1 #Loop-flag 
NEXT_INS : 

There is a need here for an additional instruction LDP 
indicating that a loop is starting. The next program 
code address is then buffered as loop start. The same 
result can also be achieved by using the instruction 
LDI and loading the next program counter value 
explicitly into the buffer. Of course, an additional 
instruction is, however, required again thereby. The 
instruction SUBI has a loop flag that is used to 
indicate which is the correct branch for the 
conditional jump. The zero flag is checked in order to 
decide whether the aim is to jump back to the start of 
the loop or to execute the next instruction (NEXT__INS) 
that is indicated by #-Loop-Flag. 

This simplified processing of loop structures requires 
a somewhat more complicated structure of the circuit 
according to the invention, as illustrated in fig. 5. 

As in fig. 4, here, as well, a program memory 10 is 
provided that is connected to the input of a 
multiplexer 12 whose output is connected, in turn, to 
the program counter (PC) 14. The output of the program 
counter (PC) 14 is likewise connected to a logic gate 
16 that combines the output value of the program 
counter with the loop flag. The output of this logic 
circuit 16 is connected to a further input of the 
multiplexer (MUX) 12. However, in the present case, the 
multiplexer 12 has a further input, which is connected 
to a buffer 18 whose input can be loaded with the value 
of the program counter 14. The explicit instruction 
"Load the next program counter reading into the buffer" 
becomes superfluous in this way. 

Fig. 6 shows the overall design of a processor with the 
ability to process the instructions according to the 
invention. Elements identical to those in figs. 4 and 5 
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are also provided with identical reference symbols. The 
program counter (PC) 14 once again accesses the program 
code memory 10, and respectively accesses the program 
line to be processed in this case. The corresponding 
5 instruction code is fed from the program memory 10 to 
the instruction decoder (IDEC) 20. The latter relays 
the corresponding control instructions to the 
arithmetic-logic unit (ALU) 22 and to the register set 
24. The contents of the registers are then loaded if 

10 needed into the arithmetic-logic unit 22, or written 
back from there again, as indicated by the arrows. The 
flag signals zero, carry and overflow of the 
arithmetic- logic unit 22 are simultaneously fed both to 
the instruction decoder (IDEC) 2 0 and to the control 

15 input of the multiplexer (MUX) 12. The two inputs of 
the multiplexer 12 are occupied by the value 1 and by 
the relative jump value #JMP supplied by the 
instruction decoder 20. The output of the multiplexer 
12 is connected to an adder 16, whose other input is 

2 0 connected to the output of the program counter 14. 

It is to be borne in mind in the case of more than two 
pipeline stages that the flag signals zero, carry, 
overflow and the associated relative jump value #JMP 

25 must simultaneously be present at the multiplexer 12. 
However, this is not necessary in the case of a two- 
stage pipeline as described in the present exemplary 
embodiment. The corresponding instruction coding with 
the post-condition according to the invention is now 

30 described below. Reference may be made once again for 
this purpose to fig. 2, which illustrates the simplest 
possible instruction set according to the invention 
with a length of 22 bits. 

35 The uppermost 6 bits (21 to 16) in this case contain 
the instruction code (OPCODE), for example: addition. 
The next three bits contain the address of the first 
register (REG A) with a length of three bits 
(conventional processors mostly use no more than 8 
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registers) on the bits 15, 14, 13, this being followed 
by the register address of the second, to be added in 
this case, register (REG B) on the bits 12, 11 and 10. 

5 In the case of this instruction, the arithmetic-logic 
unit of the processor will therefore add the contents 
of the registers A and B and store them in the 
register. According to the invention, other bits are 
now added to this instruction, specifically the bits 9 
10 to 2 (displacement) , which specify the relative jump 
distance in the case of a following conditional jump. 
The condition bits 1 and 0 then follow, the bit 1 
(post) indicating the post-condition, while the bit 0 
(PRE) indicates the precondition. 

15 

The processing cycle is now as follows in this case: 
the instruction must be fetched and decoded. For this 
purpose, the processor starts at a specific program 
counter reading, for example PC=0 . 

20 

Given this program counter reading, an instruction of 
22 bits is fetched from the program memory, which is 
present at the address in the memory corresponding to 
this program counter reading. 

25 

The instruction is then processed by the instruction 
decoder (IDEC) 20. 

An attempt is first to be made in this case to check 
30 whether the corresponding precondition bit is set. If 
this is the case, the instruction is not even executed 
given the non-fulfillment of the corresponding 
precondition. 

35 The difference between the present invention and the 
prior art resides in the post -condition bits. 
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The signal -BR_CTR" is generated from this post- 
condition bits. The addition is carried out as follows, 
at the same time: 

5 A control signal ALU-CTR and the read and write 
addresses and enable signals for the arithmetic-logic 
unit are generated. At the same time, the instruction 
decoder 20 makes the relative jump distance "BR" 
available. The "BR-CTR" signal drives the branch 
10 control according to the following prescriptions: 

1. No jump if post-condition bit = 0, that is to say 
PCNEW=PCOLD+l 

15 2. A relative jump is executed if the post -condition 
bit = 1 and the condition is fulfilled that, for 
example, zero flag = l. The program counter 14 is 
therefore set to the new value PCNEW=PCOLD+BR. 

2 0 If the post -condition bit is = 1, but the condition is 
not fulfilled, again no jump is carried out, that is to 
say : PCNEW=PCOLD+ 1 . 

It is possible to use more than one post-condition bit, 
25 as illustrated, for example, in fig. 3. It is then 
possible to check more conditions (for example zero, 
carry, overflow) . 

Thus, according to the invention, control information 
30 for the arithmetic- logic unit and information relating 
to jump destination addresses is simultaneously made 
available for the first time simultaneously by the 
instruction decoder 2 0 during decoding of the 
instructions . 

35 

The instruction is now executed and, if appropriate, 
branched. 
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The action of the arithmetic- logic unit (ALU) is 
executed for this purpose. The result is written back 
into the appropriate register. At the same time, the 
corresponding zero, carry, etc. flags are present at 
5 the output of the arithmetic-logic unit. 

The bits for the individual flags, "BRCTR" and the 
value "BR" relating to the same clock pulse are made 
available in this case to the branching control. As 
10 illustrated in fig. 7, the control unit "Cond" 26 then 
generates two control signals SI and S2 . SI prompts 
either to undertake no jump, or to work out a relative 
jump. S2 then switches through the relative jump 
address «PCNEW" through the multiplexer 12. 

15 

An additional instruction is thereby saved for the jump 
in addition to the corresponding arithmetic 
instruction. Consequently, a reduction can be achieved 
in the number of required instructions, and the 
2 0 throughput of the processor is raised thereby. 

The design of a processor for processing instructions 
with the post-condition bits according to the invention 
is illustrated in detail in fig. 7. Identical numerals 
25 to those in figs. 4, 5 and 6 refer to identical units. 

Also provided in fig. 7 is a program counter 14, which 
addresses an instruction memory (CODEROM) 10. From 
there, the instructions with an instruction width of 22 

30 bits are fed to the instruction decoder (IDEC) 20. The 
latter generates the usual signals for driving the 
register 24 and the arithmetic-logic unit (ALU) 22. 
However, according to the invention it also generates 
in addition the signals "BR" (this signal comprises a 

35 plurality of bits) and specifies the relative jump 
distance as well as the signal "BR-CTR", which 
specifies that a conditional jump is to be processed 
and the corresponding flag bits of the arithmetic-logic 
unit are to be checked. 



wo 01/22217 - 17 - PCT/EPOO/09267 

At its output, the arithmetic- logic unit 22 supplies 
results and the corresponding flags that illustrate 
specific conditions (for example, zero, overflow, 
5 carry, etc.). The results can, of course, also be fed 
to the registers 24 again. The "BR_CTR" signals and the 
flags from the ALU are fed to a further logic unit 
(Cond) 26. This generates as a function of the 
corresponding BR_CTR signals and the associated flags 

10 signals SI and S2 that control the multiplexer 12 and a 
switch upstream of the one input of the adder 16. This 
switch switches over between 1 and ''BR" as a function 
of the fulfillment of the flag conditions. The other 
input of this adder is connected to the output of the 

15 program counter 14. 

It is possible in this way according to the invention 
to carry out a substantially faster processing of 
conditional jumps with a relatively low additional 
2 0 technical outlay at the processor. 
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(57) Abstract: The invention relates to a method and a device for processing conditional jump instructions in a processor with 
pipelined architecture. One or more additional bits indicating the condition under which the conditional jump instruction is to be 
fS| executed is/are added to each instruction stating that a conditional jump is to be executed. The inventive device can also comprise a 
device for altering the count of the program counter according to the additional bits for executing the conditional jumps. 

2 (57) Zusammenfassung: Veifahren und Vorrichtung zur Bearbeitung bedingter Spnmgbefehle in einem Prozessor mit "Pipeli- 
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O hinzugefugt werden, die angeben, unter welcher Bedingung der bedingte Sprung auszufuhren isL Zusatzlich kann die Vorrichtung 
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